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SEQUENTIAL JOINT DETECTION AND ESTIMATION 

By Yasin Yilmaz*, George V. MousTAKiDEst , and Xiaodong Wang* 

Abstract. We consider the problem of simultaneous detection and esti- 
mation under a sequential framework. In particular we are interested in se- 
quential tests that distinguish between the null and the alternative hypotheses 
and every time the decision is in favor of the alternative hypothesis they pro- 
vide an estimate of a random parameter. As we demonstrate with our anal- 
ysis treating the two subproblems separately with the corresponding optimal 
strategies does not result in the best possible performance. To enjoy optimal- 
ity one needs to take into account the optimum estimator during the hypoth- 
esis testing phase. 

1. Introduction. Suppose we are observing sequentially two processes {yt}, 
{ht} which are related through the following model 

yt = xht + wt] t = l,2, 

Process {wj } is a noise sequence; x is a random variable described by the following 
two hypotheses 
Ho : x = 0, 

Hi : X ~ ip{x), 

where x ~ ip{x) means that the random variable x follows the pdf ip{x); and 
{ht} is a second obsei^ved process that affects in a time- varying and random way 
the value of the random variable x. In other words, under the null hypothesis the 
observed sequence {yt} is pure noise whereas under the alternative hypothesis it 
contains a mean which is related to the random parameter x and scaled through the 
second measured sequence {ht}. 

Sequences of this form arise in several applications in practice, the most notable 
being digital communications where x denotes the information to be transmitted. 
Under hypothesis Hq no transmission takes place, consequently the receiver mea- 
sures pure noise. Under hypothesis Hi information x is transmitted and the se- 
quence {ht} models the attenuation inflicted on this variable by a lossy and time- 
varying communication channel. We should mention that in digital communica- 
tions it is customary to assume that the channel sequence {ht} can be measured, 
consequently, assuming that this process is available, is realistic (see Proakis and 
Salehi (2008)). 
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The mathematical problem we would like to consider in this work consists: a) in 
deciding as soon as possible between the two hypotheses, and b) every time a 
decision is made in favor of Hi we would like to provide an estimate of the random 
variable x. As we realize, we have a joint detection and estimation problem where 
both subproblems are of equal importance. Indeed we note that we like to have a 
reliable estimate of x every time we detect its presence. 

Key element in our formulation constitutes the fact that we are interested in per- 
forming the joint detection/estimation task as soon as possible, suggesting that we 
intend to focus on sequential schemes to solve this challenging joint problem. Fi- 
nally, we would like to emphasize that our analysis is going to demonstrate that 
solving the joint problem by treating each subproblem separately with the corre- 
sponding optimal procedure does not yield an overall optimum performance. As 
we shall see, the detection part needs to take into account the fact that we are also 
interested in parameter estimation in order for the combined scheme to perform 
optimally. 

Sequential joint detection/estimation differs from sequential composite hypoth- 
esis testing where parameters are either mai^ginalized or treated as nuisance (see 
Lerche (1986), Pavlov (1990)). Actually, joint detection/estimation resembles more 
to sequential multi-hypothesis testing where there is a discrete set of possible prob- 
ability measures that describe the observations and we need to select one of the 
existing possibilities. Characteristic articles treating this problem are: Annitage 
(1950), Lorden (1977), Tartakovsky (1998) and Dragahn et al. (1999). The joint 
case studied in this work differs from the previous problem in the sense that we 
have a parametric family of measures (parametrized by x) and we need to select 
the correct parameter value, after establishing that this value is not 0. Existing lit- 
erature related to joint detection/estimation is very limited and addressing only the 
fixed sample size case. The articles by Middleton and Esposito (1968), Fredriksen 
et al. (1972), Moustakides (2011) and Moustakides et al. (2012), offer different 
formulation possibilities for the fixed sample size version. In the current work, we 
are focusing on the setup proposed by Moustakides (2011) and extend the corre- 
sponding result to the sequential case. 

Let us now become more technical by introducing the detection/estimation strate- 
gies we are interested in. Assuming that observations become available sequen- 
tially in pairs {(?/(, /i^)}, let {^t}t>o denote the corresponding filtration with J^t = 
a{{yi,hi), . . . ,{yt,ht)} and the trivial cr-algebra. We also define two ad- 
ditional filtration {'3^t}t>o and {J^t}t>o with = a{yi, . . . ,yt} and J^t = 
a{hi, . . . ,ht}, that is, the accumulated history pertinent to the first and second 
observed sequence respectively, and "S^OjJ^o being, again, trivial cj-algebras. We 
clearly have = ^ U J^, therefore J^t ^ -^t- 

According to what we mentioned, we are looking for a triplet (T, dr, xt) where 
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T is a stopping time, dx a decision rule that distinguishes between the two hy- 
potheses and XT an estimator for x. The detector dr and the estimator xt are 
^T-measurable functions, namely they use all available information acquired up 
to time of stopping T, for deciding between the two hypotheses and for providing 
an estimate for x every time this is deemed necessary (i.e. whenever the detector 
decides in favor of Hi). For the stopping time T, the obvious choice would be to 
ask this quantity to be {^j} -adapted, namely, at each time t to use all available 
information to decide whether to stop or continue sampling. Unfortunately, impos- 
ing this requirement induces serious analytical complications. This fact is already 
known for the two separate subproblems of detection and estimation. For instance 
if we assume that we always have yt = xht+wt and we are interested in estimating 
X then, as it is mentioned in Ghosh (1987) and in Ghosh and Sen (1991), finding the 
optimum sequential estimator of x is not a tractable problem if T is adapted to the 
complete observation history {^t}- Instead, Grambsch (1983) and more recently 
Fellouris (2012), proposed to limit T to {J^}-adapted strategies, assumption that 
leads to simple and interesting optimal solution. 

Similar analytical difficulties arise in the pure sequential hypothesis testing prob- 
lem of distinguishing between Hq and Hi. If we require T to be {^t}-adapted and 
attempt to solve this problem following, for example, the classical approach of 
Wald and Wolfowitch (1948), then the optimum scheme is not the usual SPRT as 
one would expect. This is because by observing the pair process {{yt,ht)} we end 
up with a two-dimensional optimal stopping problem which is impossible to solve 
(analytically) since the thresholds for the running likelihood ratio will depend on 
the sequence {/it}. Only if the sequence {/if} is constant, or not observed (or even 
ignored) and, additionally, we assume it is i.i.d. with known pdf, then the detection 
problem can be reduced to the one considered by Wald and Wolfowitz (1948), ac- 
cepting as solution the classical SPRT. In this case the stopping time T becomes 
{^}-adapted and the decision function dr must be selected to be ^^-measurable. 

An alternative idea would be to consider, as in the pure estimation problem, 
{=^}-adapted stopping times but, as we suggested above, allow the decision func- 
tion dx to have access to the complete information, that is, be ^T-measurable. 
This is the approach we intend to adopt in this work. In fact we are going to apply 
this idea directly to the more general joint detection/estimation problem. As we 
shall see, our analysis will also offer the solution to the pure detection problem by 
proper parameter selection. Next we summarize our assumptions. 

ASSUMPTIONS: i) The two processes {wt}, {ht} are independent and indepen- 
dent from the random variable x with the noise process {wt} being i.i.d. with 
Wt ~ o/K(0, fj^), where ^(a, 6^) denotes Gaussian pdf with mean a and vari- 
ance 6^. ii) For x, under Hi, we assume that x ~ ^(/i^;, o"^), in other words the 
prior ip{x) is the Gaussian pdf; while under Hq we assume x = 0. Parameters 
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l^x,<^x,o' are considered known, iii) For the second observation process {hf} we 
only make the very mild assumption 



that is, with probability 1, each realization of this process has infinite energy over 
the infinite time horizon. No other condition is imposed on {/it}, consequently, 
this process can have any form of dependency and time variability, with the actual 
statistical description not required to be known, iv) The stopping time T is {Mt}- 
adapted while the decision function dx and the estimator xt are J^x-measurable 
functions having access to the complete observed history up to the time of stopping. 

In the rest of our article, with Pq , Eq we denote probability measure and expec- 
tation under hypothesis Hq; with Pi, Ei probability measure and expectation under 
hypothesis Hi including the statistical description of the random variable x and, 
finally, with Pi, Ei probability measure and expectation under hypothesis Hi but 
with X being mai^ginalized. 

2. Problem formulation. In sequential detection and estimation we are usu- 
ally interested in minimizing the average delay subject to suitable constraints. 
However, in order to free our formulation from the need to specify a probability 
measure for the process {ht}, we will adopt the same idea employed in sequen- 
tial estimation, namely consider expected delays, error probabilities and average 
costs conditioned on the sequence {ht}. This approach will give rise to a triplet 
(T, dT,XT) which will be optimum for each reaUzation of {/it} and not on average 
with respect to this sequence, as is the usual case in classical Sequential Analysis. 

Since we are interested in the two subproblems of detection and estimation we 
have a number of quantities that are of interest in each case. For the detection part 
we have the Type-I and Type-II error probabilities that have the following condi- 
tional form: Po(c?t = ^J^t) and Pi {dx = 0\J^). For the estimation problem we 
assume that we are under hypothesis Hi and we adopt as cost function the mean 
squared enor. We recall that our estimate depends on the decision of our detector, 
in particular: whenever = 1 we provide an estimate xt which inflicts a squared 
estimation error {xt — x^, where x is the true value of our random parameter. Al- 
ternatively, when the detector en^oneously decides in favor of Hq, that is, = 0, 
then this is like estimating our parameter as xt = (since under Hq we have 
a; = 0) generating a squared error (0 — x)^ = x^. Consequently, for the estimation 
subproblem there are the following two conditional mean squared errors that are 

of interest: Ei[(xt — x)'^1^iIt=i}\'^] ^^'^ ^i[x^^{dT=o}\'^T], where 1a denotes 
the indicator of the event A. 



(1) 
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We can now use these four quantities to form the following combined cost func- 
tion 

(2) ^(r,dT,XT) = CoPo(dT = l\.^T)+CiPi{dT = 0|^t) 

where co,ci,Ce are nonnegative values selected by the Statistician. The last temi 
in the right hand side of (2), which refers to the estimation problem, as we can see, 
depends on both parts, namely our decision and our estimation strategy. Further- 
more, we note that if we set Ce = then the combined cost depends only on the 
decision rule dx suggesting that our joint problem is reduced into a pure detection 
problem. 

To define an optimum joint scheme, we will follow a constrained optimization 
approach, therefore we are going to consider triplets (T, drj^r) for which the 
combined cost '^(T, dx, xt) is upper bounded by some prescribed quantity. From 
the class of triplet strategies which is generated through this constraint we will 
select the one that minimizes the stopping time T. More specifically we would like 
to solve the following constrained optimization problem: 

(3) inf_ T; subject to: '^(T, dr, £t) < C*, 

where C > is the a maximal allowable combined cost we are willing to tolerate. 
Note that since T is { J^}-adapted, as we mentioned before and would like to 
emphasize once more, the triplet we are going to develop will minimize T for 
each realization of the process {/ij} and not E[T], where the average is taken over 
all reahzations of this process, as is the usual case in classical optimal stopping 
problems. 

3. Optimum solution. The optimum triplet will be obtained in three steps. 
First we will propose a candidate estimator by solving a smaller auxiliary opti- 
mization problem, then we are going to use this solution to propose a candidate 
detector that takes into account the previous estimator by solving a second auxil- 
iary optimization problem and, in the end, we will provide a candidate stopping 
time and show that all three proposed parts constitute the triplet that solves the 
original constrained optimization problem depicted in (3). Let us continue by first 
identifying our candidate estimator. 

3.1. Optimum estimation. Fix the stopping time T assuming that it is finite 
with probability 1 and the decision function dx- Consider the problem of minimiz- 
ing the conditional mean squared eixor Ei[(xt — 3;)^l{dT=i}l=^^] ^i'^h respect to 
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the estimator xt- We have the following lemma that gives the solution to this prob- 
lem and also provides a useful expression for the second term Ei[x^l|^j,=o}l=^] 
of the estimation cost. 

Lemma 1. The optimum estimator xt that minimizes Ei [(xt— | 
is given by the following formula 



T 



T 



(4) 



XT = — where Vr = 22 Vt^t'^ = 22 ^* ' 

t=i t=i 



Ut + 



while the corresponding minimum conditional mean squared error takes the form 



(5) inf Ei[(xT - xfl{d^=^}\Mr] = 77^^ 



Pi {dr = \\^t) . 



Additionally we can write 
(6) El [x21|^^^o}I^t] 



El 



a 



2\ 2 



1 



{dT=0} 



1!^ 



+ 



-Pi ((iT = 0|^). 



Proof. The proof is simple and based on the well known result that the mean 
squared error is minimized by the conditional mean of x given all available ob- 
servation history. The interesting detail is that this result is still valid even if the 
observation history is dictated by an {^}-adapted stopping time T and an J^t- 
measurable decision rule dr- To demonstrate (5), using that T is {J^}-adapted, 
dr is ^5"-adapted and J^t ^ ^t, we can write 



E^[{xT - xflfaT=i}\'^T] 



xfl{at=l}'^{T=t}\M 



J2^i [{xt - x)H{a,=i}\jn] i{T=t} 
00 

^ El [El [{xt - xf\^t] l{d,=i}\M] l{T=t}, 



t=i 



t=i 



where for the last equality we used the tower property of expectation. From classi- 
cal estimation theory (e.g. Poor 1994, page 151) we know that 
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This minimal value is attained by the conditional expectation xj = E[x|^(] which, 

2 2 

due to the fact that x, given ^t, is Gaussian with mean (Vt + ^ix^)/ {Ut + ^) and 
variance cj^/ {Ut + is equal to 



x'ZT 



Ut + 



Consequently, since Ut is ^-measurable we deduce 

,2 



a 



Ut + 



-^l{dt=i}\'^t 



OO 9 



t=i + ^ + ^ 

with equality, as we mentioned, when xt = y~T- 
To prove (6), we can write 



L{T=t} 



Pl(dT = 1|^t), 



(7) El [x^I^.^^o^IjTt] = 2^ El [x^l^^^^o^lJ^] 1^^^,} 

OO 

= J] El [El [x\^t] M=o}\M] l{T=t}- 
t=l 

Using again, as we mentioned above, the fact that x conditioned on ^t is Gaussian, 
we can compute 

2 



El [x\^t] 



Vt + Hx^ 



+ 



a 



2 TT M 



Substituting this equality in (7) and recalUng that Ut is ^-measurable, yields the 
desired result. □ 

3.2. Optimum detection. If we consider the combined cost "^(T, dx, x-p) where 
the estimator xt is replaced by the optimum estimator x^ defined in (4) then, using 
(5) and (6) we obtain 



^(r,dT,XT) = coPo(c?T = l\Mr) + ciPi[dT = ^\Mr) 

,2\ 2 



+ 



a 

'■r~2 j ^{dT=0} 



2^ 



+ Ce 
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Due to the fact that xt is the result of the minimization stated in Lemma 1 we have 
'io{T,dT,XT) < '^{T,dT,XT)- We note that the last term in the expression for 
'^(r, dT,'><-T) does not depend on the decision function dy, therefore, let us con- 
sider the sum of the first three terms of the right hand side and define the auxiliary 
cost 



(8) "^{T^dr) = coPoidr = 1|^t) + ciPi^dr = OlM^) 



+ 



El 



Vt + 



a 



2\ 2 



1 



{dT=0} 



1!^ 



In the sequel our goal is, for fixed T, to identify the decision function d^^ that 
minimizes '^(T, d^) with respect to dj^. The solution to this problem is given in 
the next lemma. 

Lemma 2. The decision function that minimizes the auxiliary cost function 
'^(T, cZt) with respect to d^, is given by the following formula 



(9) 



1 if Co <It {ci + c, 
otherwise, 



where Lt is the conditional likelihood ratio of {yi, . . . ,yt} of the two hypotheses 
given J^t, with the random variable x under Hi being marginalized, specifically 



(10) 





2 


2^2 







The resulting minimum value of the auxiliary cost function takes the form 



(11) infr (T,(iT) 

dT 



Co - Lt < 



Ci + Ce 




T 



+ Cl 



2 ^xUt 
Ce{l4 + 



where z = minjz, 0}. 
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Proof. The proof of this lemma presents no special difficulty. We can write 

oo 

(12) Po(dT = l|^r) = Yl Eo[lH=i}l^]l{r=i} 

t=i 

oo 

(13) Pi{dT = 0|^t) = Yl Eo[Ul{d,=o}l=^]l{T=t}. 
Similarly we have 



t=i 



(14) 



0^ 



t=l 



1 



{dt=0} 



L{T=i}' 



where we used the fact that Ut is ^-measurable. Substituting (12),(13),(14), in 
the definition of the auxiliary cost '^(T, dx) in (8) we obtain 



^{T,dT) 



t=i 



EE 

=1 

oo 

EEo 



Col{df=l} + U < 



Ci + Ce 



1 



{dt=Q} 



0^ 



t=l 



Co - Lj < Ci + Ce 



> 1 



L{T=t} 



L{T=t} 



lt{ci+ Ce 



Vt + 



oo 

>EEo 



Co - Lj < 



Cl + Ce 



L{T=t} 



2^ 



L{T=t} 



Cl + Ce 



1^ 



1 



{T=t\- 



We can easily verify that we have equality when the decision function is according 
to (9). The last term in the previous inequality can be shown to be equal to c\ + 
Ce{/x^ + alUx/iUT + fr)} by observing that on the event {T = t} we have that 
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Vt, under Hi and conditioned on J^, is Gaussian with mean fixUt and variance 
alUi + a^Ut = alUt{Ut + ^). 

To show the vaUdity of (10) we have that the likeUhood ratio of {yi, . . . , yt} for 

2 

the two hypotheses, given x and J^t, is equal to exp(— ^[/^ + -^iVt)- Marginal- 
izing X using the Gaussian prior yields which is therefore computed as 



e 2ct- 



dx. 



Combining the two exponents and "completing the square" for x, it is straightfor- 
ward to prove (10). □ 

From (9) when we set Cg = 0, we end up with the pure detection problem, and 
the optimum detector reduces to the usual likelihood ratio test which is applied 
at the time of stopping T. However, when Cg > 0, in the detection rule we take 
into account the estimation part, and our detector is no longer a likelihood ratio test 
since it also contains the corresponding estimation cost. Actually, this is exactly the 
point that discriminates our optimum joint detection/estimation scheme from the 
approach that solves the two problems separately by applying the corresponding 
optimum strategies. Note that the latter method would have simply applied the 
likelihood ratio test for detection and then the optimum estimator whenever the 
decision was in favor of Hi. Our scheme on the other hand makes a decision by 
taking into account the square of the optimum estimate. 

3.3. Optimum stopping time. Using the results of Lemma 2, in particular sub- 
stituting (11) in the combined cost function we obtain 



(15) ^(r,dr,XT) 



Co 



-T < 



Ci + Ce 



Vr + iicc 



Ut + 




2^ 



+ Ci + Ce(/i^ + (j: 



From the way dr, were defined, we clearly deduce that any triplet (T, cLt^xt) 
satisfies the following inequality 



(16) 



^{T,6T,XT)<^{T,dT,XT). 



Let us now make a more explicit computation of the conditional expectation ap- 
pearing in (15). For this reason, in the next lemma we define a suitable function 
([/) for which we also prove a monotonicity property that plays a crucial role 
in specifying the final term of our triplet, namely the optimum stopping time. The 
lemma is based on the observation that on the event {T = t} and given Ml, we 
have Ut known and, under Hq, Vt ~ ^(0, a'^Ut). 
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Lemma 3. Let U > and consider V to be a Gaussian random variable with 
V ~ ^(0, o"^[/). Define the following function 



(17) '^{U) 
( 



r 

J — c 



Co 



— e 









20-2 







u + K 



Cl + Ce 



771 



u + K 



e 2cr'''U 



dV. 



\ 

Then ^{U) is continuous, strictly decreasing in U > 0, with ^(0) 
and lim;7^oo '^{U) = — ci — Ce(/i^ + a"^). 



(CQ - Cl 



Proof. Because the proof is very technical, we will not present all computa- 
tional details. That ^(f7) is continuous it is obvious since the integrand is contin- 
uous in U and V. Let us now prove the desired monotonicity property of '^{U). 

2 

Call for simplicity k = ^ and define the function G{U,V) 



(18) G{U,V) 



Co 



U + K 



2 
2ct2 



Cl + Cf 



co- 



Cl + Ce 



Denote with g{U) the solution of the equation 



V + /^a^ 



\/27ro-2c/ 



g 2ct^(7{(7+k) 



(19) 



cq 



g2o-^(i7+K) 2o-^ 



Cl + Ce 



where g replaces {V + ^xf^)"^- Even though the latter quantity is nonnegative we 
allow g to take upon negative values as well so that (19) has always a solution. 
Note that the right hand side in (19) is strictiy decreasing in [/ > and strictly 
increasing in g. For fixed U if we set (7 = — ([/ + Kf'cxjcf^, the right hand side 
becomes 0. If on the other hand we let (7 — )• 00 then the right hand side tends to 00 
as well. Due to continuity and strict increase in g there is a unique solution g{lJ\ 
Using g({7) we can now deduce that the values of V for which the integrand 
in (17) and therefore G(\J,V) is nonpositive is y G TiJJ) = {-oo,-Vi{U)] U 
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[V2{U),oo) where Vi{U) = y^^+(U) + fM,K, V2iU) = ./^+^-ii^k, and z+ = 
maxjz, 0}. Note that for values of U for which g{U) < we have —Vi{U) = 
V2(^7) = — Aia:^> therefore both quantities coincide. When, however, g{U) > 
then G{U, -Vi{U)) = G{U, V2{U)) = 0. Using the previous definitions and ob- 
sei^vations we can write W{U) as follows 



(20) ^(C/) 



-Vi{U) roo 

G{U,V)dV+ / G{U,V)dV 

X) JV2{U) 

/oo 
G{U,V)lriu){V)dV. 



To show that ^{U) is decreasing, it suffices to show that its derivative is nega- 
tive. Consider first values of U for which the solution to (19) satisfies g{U) < 0. 
In this case, as we mentioned, we have — Vi{U) = V2{U) = —fix^ suggesting that 
T{U) becomes the whole real line. Thus, substituting (18) in (20), we can write 



f{U) 



G{U,V)dv] = (co-ci-Ce\fil+ ""'^ 



U+K 



<0, 



and, therefore, W{U) is strictly decreasing for all [/ > for which g{U) < 0. 

Let now U be such that the solution to (19) satisfies g{U) > 0. Substituting 
again (18) in (20) and changing variables z = V/ VU we have 



-Vi{U) 



G{U,z)dz+ / G{U,z)dz 

'V2{U) 



G{U,z)dz 



T 



where 



G{U, z) = VUG{U, zVU) 



Co 



V2 



Cl + Ce 



U + K 



_ {Z-Ma;%/C7)2 



and T{U) = (-oo, -VijC/)] ^[V2{U),oo) with Vi{U) = Vi{U)/VU; i = 1,2. 
As before it is true that G{U, -Vi{U)) = G{U, V2{U)) = 0. Taking the derivative 
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with respect to U yields 

^'{U) = -G{U, -Vi{U))Vl{U) - G{U,V2{U))mU) + [ duG{U,z)dz 

Jt{U) 



[_ du 



( 



Cl 



duG{U,z)dz 

T(U) 

U + fixK 



U + K 



dz. 



The latter integral after some tedious mathematical manipulations can be computed 
explicitly yielding 

'^'{u) = ^^y^ n{u) 

2{U + K)y/27ra^U{U + k) 



{U + KY^j2alTTU{U + k) V2<t2k(C/ + k) 



+ 1 n{u). 



where 



r2([7) = e 2alu(U + ^) _|_ g 2aiu{U + K) 

We realize that all parts involving ci and Cg are negative, suggesting that the func- 
tion ^(f/) is strictly decreasing. This is still tme even if we limit ourselves to the 
pure detection problem by enforcing Cg = 0. 

To conclude our proof we need to show the validity of the formulas for ^(0) and 
limc/_>oo '^{U). Consider first [7-^0. Since V ~ ^(0, a'^U), we have that the 
pdf of V tends to a Dirac function at 1/ = 0. For this case it is straightforward to 
verify the expression for ^(0). Computing the limit for [/ — )• oo needs more work. 
Note first that the solution g{U) of equation (19), for large U , can be expressed 
in order of magnitude as g{U) = Q{U\ogU). This means that we can find two 
positive constants ai, 02 independent of U such that, for large enough U , we have 
oiC/ log U < g{U) < log U. That this is indeed possible, can be readily seen 
because, for sufficiently large U, we have g{U) > and U + k > 1, therefore we 
can upper and lower bound g{U) from (19) by observing that 

Cl < Cl + Ce—-^—-7T < Cl + 2CeO-^ ^ 



■(C/ + k)2 - ' ' 2(j2([/ + k) 

2t , 9 



< max{ci,2ceCT"} ( 1 + ^^2(^^^n ) ^ max{ci, 2ceCj2}e^^^(^ 
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These two bounds generate, immediately, the corresponding desired upper and 
lower bounds for g{U). A direct consequence of the order of magnitude estimate 
of g{U) is that, since Vi{U) = ^/gMU) + fixi^ and V2{U) = sj g^iU) — fixi^, we 
have that Vi{U),V2{U) are both Q{y/UlogU). Using (18) and (20) to compute 
we can see that the first temi involving cq is 



Co 



u ) 



a 



V2{U) \ 

v) 



a 



since Vi{U)/^/U — )• oo, where $(2;) denotes the standard Gaussian cdf. In the 
second term involving ci, Cg, let us make the change of variable z = — ^"^""^ , 

then we can write 



T{U) 



Ci + Ce 



y + f^xi^- 

U + K 



2a'iU(U + K) 



y/2TTa'^U{U- 



-.dV 



T(U) 



U 



U + K 



where we recall T{U) = (-00, -Vi{U)] U [V2{U),oo) and we define f{U) = 
(-oo,-yi(C/)] U [V2(f/),oo) with Vi jU) = jV ^jU) + fixU)/p{U), V2{U) = 
{V2{U)-fixU) /p{U) and p(U) = (Tx-\/U{U + k). Note in the last integral that the 
integrand is nonnegative. Furthermore integration over T(C/) can be regarded as 
integration over the whole real line after multiplying the integrand by the indicator 
function of the set 'T{U). Because the indicator is nonnegative and upper bounded 
by 1 and [px + zax\/U / {U + k)]^ < 2(/x^. + z^cr^,), we can upper bound the 
integrand by a function which does not involve U and is integrable. This suggests 
that we can apply Bounded Convergence. Combining this with the observation that 
—Vi{U) — —Hx/<^x and V2{U) — —px/<^x, meaning that 'T{U) tends to the 
whole real Une or l^^^j(z) — 1, we deduce 



lim 

U^oo 




z)^—=dz 



Cl + Cg {fix + ZUx 



e 2 



dz = ci + Ceinl + al), 



which is the desired expression. This concludes the proof of our lemma. 
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The function W{U) introduced in Lemma 3 is very important and will simplify, 
considerably, the representation of the combined cost '^{T, dr, xy). Indeed, by re- 
calling the definition of Lj from (10), we can identify the conditional expectation 
appearing in (15) as '^{Ut)- This means that in "^(T, d^, xt) if we replace xt, dr 
with their optimum counterparts x^, then we have the following simple expres- 
sion for the resulting combined cost 

(21) ^{T,dT,^T)=^{UT) + Cl+Ce{^^l+al). 

We are now in a position to reveal the optimum stopping time and finalize the 
desired triplet that solves the constrained optimization problem introduced in (3). 
The next theorem presents the complete solution. 

Theorem 1. In the constraint in (3), let the maximal allowable cost C satisfy 
min{co,ci + Ce/i^-} + CeC^ > C > 0, then the optimum triplet (T, djjXj) that 
solves the corresponding constrained optimization problem is: 

(22) T = inf{t > : C/t > 7}, 
where threshold 7 > the solution of the equation 

(23) ^{^) = C -c^-ce{^ll + al), 

and the other two elements of the optimum triplet are given by {9) for the optimum 
detector and (4) for the optimum estimator with T replaced by T in both cases^. 

Proof. First note that when min{co,ci + Cg/x^} + Cgcr^ > C > 0, then 
C — ci — Ce(/i^ + o"^) takes values in the interior of the interval defined by the 
maximum $f(0) and the minimum value limu^oo'^{U) of the function W{U). 
Consequently, because of the strict monotonicity and continuity of ^{U), equation 
(23) has always a positive solution 7 > which is unique. Given that Uq = 0; 
Ut = X^tt=i is increasing; and by assumption Wvnt^oo Ut = 00 with probabil- 
ity 1, we also conclude that the stopping time T defined in (22) is almost surely 
finite. 

Let us now show the desired optimality of the proposed triplet. Consider any 
alternative triplet {T,dT,XT) that satisfies the constraint C > 'io{T,dT,XT)- Be- 
cause of (16) and (21) we conclude 

C>^(r,dT,XT) >^(r,dT,XT) =S^(^7T)+Ci+Ce(/i^+CT2). 

'Actually this simply suggests that the detector and the estimator need to be applied at the stop- 
ping time T. 
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The previous inequality combined with (23) suggests that 

^(C/T)<C-Ci-Ce(/i^+a2)=^(7) 

which, due to the strict decrease of ^{U), implies Ut > 7- From the latter we 
deduce that T > T since, by definition, T is the smallest time instant for which this 
inequality can hold. This establishes the optimality of the triplet (T, dy, xj). □ 

Remark 1 : For our theorem we must also add that if C > min{co, ci +Ce/i^} + 
CgfT^ then we can verify that the optimum stopping time is T = (no observations 
are needed) and the optimum joint detection/estimation structure depends, solely, 
on prior information. In paiticular if cq < ci + Cg/^t^ then we decide in favor of Hi 
and provide as estimate the mean, that is, xq = fi^; whereas if cq > ci + Cgfi^ then 
we decide in favor of Hq and, of course, there is no need for any estimate. 

Remark 2: Our theorem suggests that the optimal time to stop is when the 
running energy {Ut} of the process {ht} exceeds the threshold 7 for the first time. 
This will happen with probability 1, due to our original assumption (1). This is the 
only requirement imposed on {ht} while no additional prior information is needed 
regarding this observed process. As far as thi^eshold 7 is concerned, it is clear that 
the solution to equation (23) can be computed numerically. 

Remark 3: As we mentioned earlier, if we select Cg = then our joint setup 
reduces to a pure detection problem. What is interesting in our formulation is that 
the optimum stopping time T is still defined through (22) while the optimum de- 
cision function dj becomes a likelihood ratio test where Lj is compared against 
the threshold ^. This is in contrast with SPRT where, as we recall, we have a run- 
ning likelihood ratio compared against two (time-varying and dependent on {ht}) 
thresholds that are not possible to compute analytically. Furthermore, SPRT is op- 
timum only when the observations are i.i.d. whereas our simple scheme enjoys 
optimality even if the process {ht} is dependent and time varying with unknown 
statistic. 
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